Optimizing to Arbitrary NLP Metrics using Ensemble Selection

نویسندگان

Art Munson

Claire Cardie

Rich Caruana

چکیده

While there have been many successful applications of machine learning methods to tasks in NLP, learning algorithms are not typically designed to optimize NLP performance metrics. This paper evaluates an ensemble selection framework designed to optimize arbitrary metrics and automate the process of algorithm selection and parameter tuning. We report the results of experiments that instantiate the framework for three NLP tasks, using six learning algorithms, a wide variety of parameterizations, and 15 performance metrics. Based on our results, we make recommendations for subsequent machine-learning-based research for natural language learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

Investigating the Habitat Patches of the Baluchistan Black Bear (Ursus thibetanus gedrosianus), Using Landscape Metrics (Case Study: Bahr Asman and Zaryab Areas, Kerman Province)

Habitat analysis using landscape metrics can be efficient in better management of habitat. As a critically endangered subspecies, the Baluchistan black bear is scattered in the Bahr Asman and Zaryab areas in Kerman province. The purpose of this study was to model the distribution of the sub-species and evaluate the quality of its habitat patches, using landscape metrics. Distribution modeling w...

متن کامل

Multi-Metric Optimization Using Ensemble Tuning

This paper examines tuning for statistical machine translation (SMT) with respect to multiple evaluation metrics. We propose several novel methods for tuning towards multiple objectives, including some based on ensemble decoding methods. Pareto-optimality is a natural way to think about multi-metric optimization (MMO) and our methods can effectively combine several Pareto-optimal solutions, obv...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

What's in a p-value in NLP?

In NLP, we need to document that our proposed methods perform significantly better with respect to standard metrics than previous approaches, typically by reporting p-values obtained by rankor randomization-based tests. We show that significance results following current research standards are unreliable and, in addition, very sensitive to sample size, covariates such as sentence length, as wel...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Optimizing to Arbitrary NLP Metrics using Ensemble Selection

نویسندگان

چکیده

منابع مشابه

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Investigating the Habitat Patches of the Baluchistan Black Bear (Ursus thibetanus gedrosianus), Using Landscape Metrics (Case Study: Bahr Asman and Zaryab Areas, Kerman Province)

Multi-Metric Optimization Using Ensemble Tuning

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

What's in a p-value in NLP?

عنوان ژورنال:

اشتراک گذاری